Matthieu Geist And NotFranche-Comté
List of bibliographic references
Number of relevant bibliographic references: 11.Ident. | Authors (with country if any) | Title |
---|---|---|
000D16 | Matthieu Geist [France] ; Bruno Scherrer [France] | Off-policy Learning with Eligibility Traces: A Survey |
001183 | Bruno Scherrer [France] ; Matthieu Geist [France] | Policy Search: Any Local Optimum Enjoys a Global Performance Guarantee |
001445 | Edouard Klein [France] ; Bilal Piot [France] ; Matthieu Geist [France] ; Olivier Pietquin [France] | Classification structurée pourl’apprentissage par renforcement inverse |
001692 | Edouard Klein [France] ; Bilal Piot [France] ; Matthieu Geist [France] ; Olivier Pietquin [France] | Classification structurée pour l'apprentissage par renforcement inverse |
001750 | Matthieu Geist [France] ; Bruno Scherrer [France] | Off-policy Learning with Eligibility Traces: A Survey |
001A68 | Bruno Scherrer [France] ; Mohammad Ghavamzadeh [France] ; Victor Gabillon [France] ; Matthieu Geist [France] | Approximate Modified Policy Iteration |
001B39 | Bruno Scherrer [France] ; Victor Gabillon [France] ; Mohammad Ghavamzadeh [France] ; Matthieu Geist [France] | Approximate Modified Policy Iteration |
002138 | Matthieu Geist [France] ; Bruno Scherrer [France] | l1-penalized projected Bellman residual |
002139 | Bruno Scherrer [France] ; Matthieu Geist [France] | Recursive Least-Squares Learning with Eligibility Traces |
002279 | Bruno Scherrer [France] ; Matthieu Geist [France] | Moindres carrés récursifs pour l'évaluation off-policy d'une politique avec traces d'éligibilité |
002303 | Edouard Klein [France] ; Matthieu Geist [France] ; Olivier Pietquin [France] | Batch, Off-policy and Model-Free Apprenticeship Learning |
This area was generated with Dilib version V0.6.33. |